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METHODS FOR PRODUCING AND ANALYZING BIOPOLYMER I.ADDERS 



10 



BACKGROUND OF THE INVENTION 
The present invention relates to the field of 
biochemistry. In particular, it relates to methods of 

20 creating biopolymer ladders useful for obtaining structural 
information about the biopolymer, among other things. 

The analysis of the structure of various biopolymers 
is an area of great importance in biochemistry. Molecular 
genetics depends on a knowledge of the nucleotide sequence of 

25 DNA or RNA molecules. The amino acid sequence of proteins 
provides information useful for studying their function and 
for isolating the nucleic acids that encode them. There is an 
increasing recognition of the importance of post-translational 
protein modifications in the coordination and control of 

30 biological processes at the molecular level. In particular, 
many cellular processes, including cell division, are 
controlled by selective and reversible phosphorylation of 
proteins. More recently, as carbohydrates have proven to be 
of importance in various biological functions, their structure 

35 has become a focus of attention. Various artificial polymers 
are also being developed to mimic the function of the natural 
biopolymers . 
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Various strategies exist for analyzing the structure 
of biopolymers . The most commonly used method of determining 
the sequence of nucleic acids, the dideoxy method, involves 
creating four sets of sub- sequences of a DNA molecule that 
5 terminate at each of the four bases, separating the fragments 
by polyacrylamide gel electrophoresis (PAGE) , and using 
autoradiography to readout the sequence. By contrast, 
analysis of carbohydrate structure involves a laborious 
process of repetitively cleaving monosaccharides from a 

10 polysaccharide, and determining the identity of each 
monosaccharide removed. 

A commonly used method of peptide sec[uencing, Edmcui 
degradation, involves sequentially cleaving amino acids from 
the amino -terminus of a peptide, and then determining the 

15 identity of the amino acid cleaved. However, traditional 
Edman degradation gives no information concerning post- 
translational modification of polypeptides. Additionally, 
Edman degradation cannot be used for C- terminal sequencing. 
Such information is crucial for correlating gene structure and 

20 protein function. 

Peptide sequencing has also been attempted using 
exopeptidases to remove terminal amino acids from a 
polypeptide. Such methods involve removing samples of the 
polypeptide during the time course of digestion, terminating 

25 the action of the exopeptidase, and analyzing the resulting 

collection of samples. However, confusion can result from the 
widely varying rates at which the enzyme cleaves the peptide 
bond between different amino acids, which can cause gaps in 
the collection of peptides produced. 

30 

SUMMARY OF THE INVENTION 

The present invention provides, for example, methods 
for producing a biopolymer ladder. In a first step, an 
ensemble of biopolymer molecules is contacted with a 
35 terminating reagent that reacts with the biopolymer molecules 
to add a terminating moiety, and with a cleavage reagent that 
catalyzes the removal of terminal ruiits from* the end of a 
biopolymer. According to one aspect of the invention, the 
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rate of removing terminal units from biopolymer molecules, 
bearing a terminating moiety is less than the rate of removing 
terminal units from biopolymer molecules that do not bear a 
terminating moiety. The biopolymers are contacted with the 
5 reagents under conditions and in amoxints so as to create a 
collection of biopolymers comprising a nested set. 

According to certain embodiments of this invention, 
the biopolymer is a polypeptide, the cleavage reagent is a 
carboxypeptidase, the terminating reagent is an amino acid 

10 amide, and the rate at which amino acids are removed from the 
polypeptide is faster than the rate at which the amino acid 
amide is added to the polypeptide. In certain embodiments, 
the cleavage reagent is thermostable. 

This invention also provides biopolymer ladders 

15 comprising a nested set of at least two and preferably at 

least three biopolymer molecules wherein the members bear the 
same terminating moiety added catalytically . 

This invention also provides methods of obtaining 
information about the structure of a biopolymer involving 

20 preparing a biopolymer ladder according to the methods of this 
invention, determining the differences in mass between the 
members of the ladder, and determining the identity of units 
whose masses correlate with the determined mass differences. 
According to one embodiment of the invention, the biopolymer 

25 is a polypeptide, the cleavage reagent cleaves a C- terminal or 
N- terminal peptide bond of polypeptides, thereby removing a 
terminal amino acid, and the method produces information about 
the sequence of amino acids in the polypeptide. According to 
another embodiment of this invention, the method of 

30 determining mass differences involves subjecting the 
biopolymer ladder to mass spectrometry analysis. 



BRIEF DESCRIPTION OF THE DRAWINGS 
Fig. 1 illustrates the overall process of one 
35 embodiment of the invention; 

Fig. 2 illustrates a matrix-assisted laser 
desorption/ionization ("MALDI") time-of -flight mass 
spectrometer ; 
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Fig. 3 shows the MALDI time-of~ flight ("TOF") mass 
spectrum of synthetic porcine renin substrate; 

Fig. 4 shows the matrix- assisted laser 
desorpt ion/ ionization mass spectrometric ( "MALDI-MS" ) spectrum 
5 of a peptide ladder of renin substrate tetradecapeptide made 
according to the methods of this invention; and 

Fig. 5 shows a sequencing spectrum of synthetic 
porcine renin sxabstrate subjected to carboxypeptidase 
digestion but without a terminating reagent. 

10 

DETAXIiED DESCRIPTION OF THE INVENTION 

This invention provides a method for the generation 
of biopolymer ladders via controlled digestion of' a formed 
polypeptide using a catalytic agent- Biopolymer ladders are 

15 useful for obtaining structural information about the parent 
biopolymer. In one embodiment of the invention, the 
biopolymer ladder is composed of nested fragments of a parent 
biopolymer (optionally including the parent) that differ in 
length by one monomer unit each. These ladders are useful for 

20 determining the sequence of units in the parent biopolymer by, 
for example, determining the difference in mass between 
adjacent members of the ladder. The difference corresponds 
to, and aids in the identification of, the unit removed to 
produce the shorter fragment. 
' 25 The methods of this invention involve setting up 

competing reactions directed to the terminus of the biopolymer 
molecules- One reaction is the removal of terminal units from 
the biopolymer molecules by a cleavage reagent, such as the 
hydrolysis of a terminal amino acid from a peptide by an 

30 exopeptidase. The other reaction is the reaction of the 

biopolymer molecules with a terminating reagent that results 
in addition of a terminating moiety to the biopolymer. The 
reactions are selected so that the rate for removing a 
terminal xinit or units from biopolymers bearing a terminating 

35 moiety is slower than the rate for removing a terminal unit or 
units from biopolymer molecules that do not have terminating 
moieties. Thus, once a terminating moiety hats been added to a 
biopolymer by reaction with the terminating reagent, the 
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modified terminal unit is essentially refractory to cleavage 
by the cleavage reagent. 

One embodiment of this invention takes advantage of 
the fact that catalysts lower the activation energy for 
5 reactions in both directions, bringing the system to an 

eq[uilibrium defined by the relative concentration and chemical 
potentials of the reactants. In this embodiment, the 
terminating reagent is added to the system in molar excess, so 
that the equilibrium of the reaction of adding or removing the 

10 terminating moiety to or from the biopolymer is significantly 
in the direction of adding the terminating moiety. When the 
system is properly optimized, for any particular fragment size 
a terminating moiety is added to some fraction of -the 
biopolymers and a terminal unit or units are cleaved from some 

15 other fraction. Those biopolymer fragments that are capped 
with a terminating moiety tend to resist further cleavage. 
Those that are not capped will be available for cleavage and 
subsec[uent capping. By allowing this reaction to proceed, a 
set of truncated fragments of the starting biopolymer is 

20 formed with the members of the set differing in composition by 
the same number of monomer units. 

As used herein, the term "biopolymer" means a 
polymer made of a sequence of biochemical monomer units. 
Biopolymers include, for example, peptides, nucleic acids, 

25 complex carbohydrates, including analogs of these biopolymers, 
as well as other products of the sequential addition of 
biochemical monomer units. This includes, for example, 
peptides containing D- or L-amino acids, amino acid analogs 
and peptidomimetics that induce alpha helices, beta- or gamma- 

30 turns or other structural elements and nucleic acids 
containing nucleotide analogs. 

. As used herein, the term "biopolymer ladder" means a 
• nested set of at least two and preferably, at least three, 

biopolymer molecules. As used herein, a "nested set" means a 

35 set in which the sequence of units of each member in the 
nested set, except, of course, the largest member, is 
contained in the sequence of a larger member « in the set. The 
difference in size between any members of a nested set 
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normally equals the number of terminal units removed in a 
single cleavage reaction- When the set contains at least 
three members/ and the size of each member in the set differs 
from the next largest member by the same number of terminal 
\mits. For example, a biopolymer ladder created from a 
biopolymer with the sequence A-B-C-D-E-F-G-H-I, could include, 
for example, the nested set of biopolymers: A-B-C-D-E-F-G-H-* , 
A-B-C-D-E-F-G-*/ A-B-C-D-E-F-*, A-B-C-D-E«*, A-B-C-D-*, and 
A-B-C-*. ("*" indicates a terminating moiety.) In this set, 
the biopolymers differ in length by one biopolymer unit. 
Another example of a biopolymer ladder created by the action 
of a cleavage reagent that removes two terminal units at a 
time from the biopolymer could be, for example, 
A-B-C-D-E-F-G-*, A-B-C-D~E-* and A-B-C-*. In this set, the 
biopolymers differ in length by two units. In the case of a 
branched biopolymer, such as a complex carbohydrate, a 
sequence is contained in another sequence if the structure of 
the larger branched structure is the same as the smaller 
branched structure except for at least one terminal unit, 
e.g., a terminal sugar moiety. 

As used herein, the term "cleavage reagent" means a 
reagent containing a catalytic activity capable of catalyzing 
the removal of monomer units from a biopolymer. In one 
embodiment of the invention, the cleavage reagent is an 
enzyme. Other cleavage reagents include, for example, acids, 
bases, or naturally occurring products. The cleavage reagent 
can catalyze the removal of a single \init or multiple units 
from the end of the biopolymer. 

When the biopolymer is a peptide the cleavage 
reagent can include, for example, exopeptidases, i.e., 
carboxypeptidases or aminopeptidases. Useful exopeptidases 
include mono-peptidases and poly-peptidases, such as di- 
peptidases and tri -peptidases . This invention contemplates, 
in particular, the use of carboxypeptidase Y, carboxypeptidase 
P, carboxypeptidase A and carboxypeptidase B. It also 
contemplates the use of aminopeptidases, such as leucine 
aminopeptidase, microsomal peptidase, and so « forth. 
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Certain peptidases catalyze the breaking of the 
peptide bond between certain amino acids faster than the 
peptide bond between other amino acids. For example, 
carboxypeptidase A cleaves the peptide bond between a 
5 carboxyl terminal proline and other amino acids (i.e., NHa-AAi- 
. . .-AA^-Pro-COOH) much more slowly than the peptide bond 
between other amino acids at the carboxyterminus . Therefore, 
in certain embodiments of the invention, the cleavage reagent 
can be a cocktail, including more than one peptidase, each 

10 with different specificities so that the rate of cleavage of 
peptide bonds is more uniform than the rate using any one of 
the cleavage reagents alone. One example of this is the use 
of a mixture of carboxypeptidase A and carboxypeptidase Y to 
produce a peptide ladder in which all the members of the 

15 nested set of fragments are represented. 

Proteins and other biopolymers can have secondary or 
tertiary structure that leaves the terminal residues 
unaccessible to the cleavage reagent. Accordingly, this 
invention provides means and methods for exposing the termini 

20 of biopolymer molecules to allow the cleavage reagent access 
to the termini, for example, by denaturing the biopolymer 
molecules. One embodiment of this invention involves 
contacting the ensemble of biopolymer molecules with a 
thermostable cleavage reagent. As used herein, the term 

25 "thermostable cleavage reagent" refers to a cleavage reagent 
having an optimal activity at a temperature above 50° C. In 
this method, the ensemble of biopolymer molecules is contacted 
with the cleavage reagent at a temperature that will at least 
partially denature the biopolymer molecule, for example by 

30 disrupting secondary and/or tertiary structure. For many 

proteins, one can adequately denature the molecule so as to 
expose a terminus by exposing the molecules to temperatures 
above about 70^ C- Normally, reactions will be carried out 
below about 100° C. However, the temperature can be chosen 

35 with reference to the optimal reaction temperature of the 

cleavage reagent. For example, a reaction temperature of 90° 
C may be preferable in the case of a thermostable enzyme whose 
optimal reaction temperature is 90° C. 
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Thennostc±>le enzymes such as those found in 
thermophilic bacteria are useful in this invention. Several 
thermostable carboxypeptidases have been identified in and 
isolated from thermophilic bacteria. For example, 
carboxypeptidase Taq, found in Thermus aqaatlcus YT-1, has an 
optimum reaction temperature of 80** C. (S.-H. Lee et al. 
(1994) Biosci. Biotech. Blochem. 58 : 1490-1495 • ) Thermostable 
aminopeptidases have been identified in and isolated from 
Bacillus stearot±Lermophllu8 (Y. Sakamoto (1994) Bloscl, 
Biotech. Blochem. 58:1675-1678); Streptococcus thermophilus 
(M.P. Chapot char tier (1994) Eur. t7. Blochem. 224:497-506); 
Streptococcus sallvarius eubsp, thermophilus (amino peptidase 
N) (R.G. Midwinter (1994) J. Appl. Bacterlol., 77:288-295); 
and Sulfolohus solfatarlcus (M. Hanner et al. (1990) Blochem. 
et Blophys. Acta 1033:148-153). 

. It will be clear to anyone skilled in the art that 
other cleavage reagents having desirable properties can be 
isolated from organisms living in extreme environments and 
used in the methods of this invention. For example, salt- 
tolerant cleavage reagents can be isolated from halophilic 
bacteria or archeabacteria . Salt -tolerant cleavage reagents 
are useful for preparing biopolymer ladders from biopolymers, 
especially proteins, that are denatured under high ionic 
strength conditions. 

In the case of nucleic acids, the cleavage reagent 
can include, for example, 3' or 5' exonucleases such as Bal-31 
nuclease, E. coll exonuclease I, E. coll exonuclease VII, E. 
coll DNA polymerase I exonuclease activity, the exonuclease 
activity associated with Klenow fragment of DNA polymerase I, 
an exonuclease activity associated with T4 coliphage DNA 
polymerase, and so on. 

In the case of carbohydrates, the cleavage reagent 
can include, for example, exoglycosidases such as a variety of 
fucosidases, mannosidases, galactosidases and so forth. 

As used herein, the term "terminating reagent" means 
a reagent that, upon reaction with a biopolymer, adds a 
"terminating moiety" to the biopolymer that &lows or prevents 
the reaction of the cleavage reagent with the biopolymer. 
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That is, the reaction for removing a terminal unit or units 
from a biopolymer bearing a terminating moiety is slower than 
the reaction rate for. removing a terminal unit or units from a 
biopolymer molecule that has not reacted with a terminating 
5 moiety. When the cleavage reagent is an enzyme, the maximum 

velocity for removing tearminal units from a biopolymer bearing 
a terminating moiety is slower than the maximum velocity for 
removing a terminal unit from a biopolymer not having a 
terminating moiety. 

10 In one embodiment of this invention, the cleavage 

reagent catalyzes both the cleavage of terminal units and the 
addition of the terminating moiety, e.g., when the cleavage 
reagent is am enzyme. In this case, the terminating reagent 
can be selected to have a structure sufficiently different 

15 from the original units of the biopolymer that the rate 

constant for adding/removing the terminating reagent is slower 
that for adding/removing the original units. According to a 
preferred embodiment of the invention, the rate at which non- 
terminated units are removed from the biopolymer molecules is 

20 greater than the rate at which terminating moieties are added 
to terminal units of biopolymer molecules - 

For example, in the case of peptides in which the 
cleavage reagent includes a carboxypeptidase, the terminating 
reagent can be an amino acid amide. L-lysinamide has at least 

25 two desirable properties as a terminating reagent for use in 
peptide ladder generation. First, L-lysinamide is highly 
water soluble. This allows its use at high molar 
concentration. Secondly, L-lysinamide contains a basic moiety 
and, therefore, may enhance peptide ladder detection when the 

30 means to be used for analyzing the peptide is MALDI mass 
spectrometry because basic groups can aid in ionization. 
Because the amino acid amides have amide groups in place of 
carboxyl groups, carboxypeptidases catalyze their 
addition/removal from the carboxy- terminus of a peptide at 

35 rates several orders of magnitude less than ordinary amino 
acids. For example, carboxypeptidase Y catalyzes the 
hydrolysis of C-terminal peptide bonds of peptides at rates up 
to three orders of magnitude greater than for terminal amides 
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when comparing the kcat/KM for the hydrolysis of FA-Phe-Leu-OH 
and FA-Phe-Gly-NHj (Raaschou-Nielsen et al . (1994) Peptide 
Research 7 :132-135) - Thus, for all practical purposes, once 
an amino acid amide is added to the carboxy-terminus of a 
peptide, the carboxypeptidase will not remove it as it 
continues to cleave terminal amino acids from other peptide 
molecules . 

When the cleavage reagent is an amino peptidase, the 
terminating agent can be an n-acetyl amino acid, such as n- 
acetyl lysine . 

In a variation of these methods, the terminating 
reagent can include a mixture of isotopes . Biopolymer ladders 
containing such mixtures are useful in detecting members of 
the ladder. For example, members of the ladder bearing 
termination moieties that differ in mass can be "identified by 
split peaks in mass spectrometry {W,R. Gray (1970) Biochem. 
Biqphys. Res. Commun. 41:1111-1119} . Members of ladders 
bearing radioactively labelled terminating moieties also can 
be identified by autoradiography. 

Methods of producing biopolymer ladders involve 
optimizing the reaction system by using appropriate amounts of 
the reagents and selecting reaction conditions so as to form a 
ladder. This involves using a sufficient amount of 
terminating reagent so as to produce a detectable amount of 
terminated biopolymer at each secjuential cleavage step. This 
amount depends, of course, on the sensitivity of the detection 
means selected. Typically, the terminating reagent is added 
in molar excess so as to significantly drive the reaction in 
the direction of adding terminating moieties. According to 
one embodiment, terminating reagent is added in an amount so 
that several percent of the biopolymers are terminated in each 
reaction. A range of useful amounts of ladder members can 
easily be determined empirically. 

Other factors that can be modulated to set 
appropriate conditions include temperature, pH, and buffer 
system. Generally speaking, when the cleavage reagent is an 
enzyme, conditions under which the enzyme would normally be 
used will provide a good starting point for further optimizing 
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of reaction conditions. Additionally, in the case that a 
cocktail of enzymatic cleavage reagents is to be used/ it is 
useful to select compatible enzymes. Taking pH as an example, 
in many cases it will be useful to select enzymes which have 
5 similar activity profiles as a function of pH, however there 
may be cases in which it is desirable to use an enzymatic 
cleavage reagent at less than its optimal pH in order to 
control substrate preference effects. 

The biopolymers subjected to the methods of this 

10 invention can have a variety of sizes. However, the impact of 
the average uncertainty in measured mass becomes more 
significant as the molecular mass increases. Below 3500 
daltons, the mass deviation is less than ±0.3 dalton, and 
there is no ambiguity in distinguishing even the most closely 

15 related pairs of monomers and, in particular, amino acids 

(Leu/Ile have identical mass) . However, above 3500 daltons, 
uncertainties of 0.4 to 0.9 daltoi. introduce certain 
ambiguities in the identification of monomers of closely 
similar residue masses. For polypeptides that do not contain 

20 any amino acids having similar mass to other amino acids, this 
iincertainty will cause no problem at all. Even if there is 
some ambiguity in the identity of certain monomers in the 
polymer, the method of the invention still provides useful 
sequence information. Accordingly, in certain embodiments of 

25 the invention, the biopolymer has mass below about 5 kDa and, 
preferably, below about 3.5 kDa. When the biopolymer is a 
polypeptide, the polypeptide can have fewer than about 50 
amino acids and, preferably, fewer than aO^out 35 amino acids. 
Valuable sequence information can be obtained from 

30 polypeptides having larger sizes, as well. 

This invention also provides the biopolymer ladders 
produced by the methods of this invention. These ladders are 
characterized by a collection of biopolymers having at least 
two, and preferably three members in a nested set wherein the 

35 members bear the same terminating moiety. In particular, this 
invention is directed to ladders having a catalytically added 
terminating moiety such as, in the case of peptides, an amino 
acid amide such as lysinamide. 
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Biopolymer ladders can yield useful structural 
information about the biopolymer^ including for example, the 
sequence of monomer units. When the cleavage reagent cleaves 
one unit from the biopolymer at a time, the members of the 
ladder will differ in mass by the mass of the unit removed, 
thereby providing information aibout the identity of the 
removed tuait. Ladders created by the action of catalysts that 
remove more than one monomer unit from a biopolymer at a time 
also provide useful information. For example, digesting a 
protein with trypsin will leave either lysine or arginine at 
the carboxy* terminus. When the resulting peptide is digested 
with a di-peptidase in the methods of this invention, the mass 
of the first two units removed will ec[ual the mass of arginine 
or lysine and the mass of the penultimate amino acid, thus 
helping to yield its identity. The identity of several amino 
acids at either end of a protein often will suffice to prepare 
primers for PGR useful in amplifying a gene encoding the 
protein. Even incomplete sequence information can supplement 
information from other sources. 

Accordingly, this invention is also directed to 
methods of obtaining sequence information about a biopolymer. 
The methods involve preparing a biopolymer ladder comprising a 
collection of biopolymers in a nested set according to the 
methods of this invention, determining the differences in mass 
between the members, and determining the identity of units 
whose masses correlate with the determined mass differences. 
The difference in mass between members of the ladder aids in 
identifying the removed units. 

According to one embodiment, this invention provides 
a method for determining the sequence of amino acids in a 
polypeptide. The method involves creating a polypeptide 
ladder having a nested set of peptides in which each of the 
peptides differs in length from another peptide in the set by 
one amino acid, determining the difference in mass between the 
peptides in the nested set, and determining the identity of 
amino acids whose masses correlate with the determined mass 
differences. The identity of the amino acids* in the nested 
set indicates the sequence of amino acids in the peptide. 
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This method does not recjuire determining the amino acid 
sequence of the entire polypeptide. Partial sequences are 
also useful. 

This invention contemplates several methods of 
determining mass differences between members of a biopolymer 
ladder. According to one embodiment of this invention, the 
mass differences between members of the ladder are determined 
by mass spectrometry. Methods for mass spectrometry include, 
for example, ^"Cf plasma desorption, electrospray ionization, 
fast ion bombardment, chemical induction decay and, in 
particular, matrix*assisted laser desorption mass spectrometry 
("MAIiDI-MS") . Methods of performing mass spectrometry on 
biomolecules are well known in the art . 

Apparati and methods for performing MALDI-MS are 
described in International Publication WO 93/24835 (United 
States patent application 08/341,555), United States patent 
5,288,644, R. Beavis and B. Chait (1990) PNAS 87:6873-6877, 
B. Chait and K. Standing (1981) Xnt. J*. Mstss Spectrom, Ion 
Phys. 40:185 and Mamyrin et al. (1973) Sov. Phys. CTETP 37:45, 
all incorporated herein by reference for all purposes. 
Briefly, the frequency tripled output of, e.g., a Q-switched 
Lumonics Hy400 neodymium/yttrium aluminum garnet laser ("Nd- 
YAG") (355 nm, 10-nsec output pulse) is focused by a lens (12- 
inch focal length) through a fused silica window onto a sample 
inside the mass spectrometer. The product ions formed by the 
laser are accelerated by a static electric potential of 3 0kV. 
The ions then drift down a 2-m tube maintained at a vacuum of 
30 ^Pa and their arrival at the end of the tube is detected 
and recorded using, e.g., a Lecroy TR8828D transient recorder. 
The transient records of up to 200 individual laser shots are 
summed together and the resulting histogram is plotted as a 
mass spectrum. Peak centroid determinations and data 
reduction can be performed using a VAX workstation or other 
computer system. 

Protein samples can be prepared for laser desorption 
analysis by the following procedure. The laser desorption 
matrix material that preferably absorbs above 3 00 nm is 
dissolved in aqueous 30% (vol/vol) acetonitrile containing 
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0.1% (wt/wt) trif luoroacetic acid, to make a standard solution 
at 20* C (-50 mM) . A preferred matrix is of -cyano- 4 -hydroxy 
cinnamic acid. A solution containing the protein sample of 
interest is then added to the matrix solution to give a final 
protein concentration of 0.1-10 fiM. A small aliquot (0.5/zl) 
of this mixture is then applied to a flat metal probe tip (2- 
mm diameter) and dried at room temperature with forced air. 
The resulting deposit is washed in 4* C distilled water by 
immersing the tip for 10 seconds. This washing step aids the 
removal of soluble ionic contaminants from the protein/matrix 
deposit, without removing the proteins or matrix. Once the . 
sample is washed, it is inserted into the mass spectrometer 
and analyzed. The entire protocol, from the beginning of 
sample preparation to finished mass spectral analysis, takes 
about 15 minutes. 

Another method useful for determining the difference 
in mass of members of biopolymer ladder is acrylamide gel 
electrophoresis . 

We determined the sequence of synthetic porcine 
renin substrate tetradecapeptide (Sigma) using 
carboxypeptidase Y. The peptide has the amino acid sec[uence 
DRVYIHPFHL LVYS [SEQ ID NO:ll (using the one letter code for 
amino acid residues) and molecular weight of 1759.0 Da. 
Figure 3 is the matrix-assisted laser desorption ionization 
mass spectrometric (MALDI-MS) spectrum of renin substrate 
tetradecapeptide. The MALDI-MS analysis was carried out on a 
laser desorption ionization time-of -flight mass spectrometer 
constructed at the Rockefeller University (R. Beavis and B. 
Chait (1990) Anal. Chem. 62:1836-1840). a-Cyano-4- 
hydroxycinnamic acid was used as the laser desorption matrix 
(Beavis et al . (1992) Org. Mass Spectrom. 27:156-158) in the 
form of saturated solution in water/acetonitrile (2:1, v/v) . 
One ^1 of peptide solution was mixed with 5 /il of matrix 
solution. 0.5 fil of sample solution was then applied on the 
sample probe and analyzed (Fig. 3) . In this spectrum, the 
singly (MH+) and doubly (2+) protonated molecular ions were 
observed at m/z 1759.9 and 880.5 amu. The peak denoted as 
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"d-Asp" (at m/z 1644.7 amu) is an amino- terminal truncated 
peptide impurity. 

The carboxy- terminal sequence of this peptide was 
obtained by using carboxypeptidase digestion in the presence 
of an excess amount of amino acid amide followed by or 
concurrently with matrix-assisted laser desorption ionization 
mass spectrometric data analysis (Hillenkamp et al. (1991) 
Anal, Chem. 63 :1193-1202) • The amino acid amide L-lysinamide 
was used to balance the fast digestion steps of the 
carboxypeptidase during the digestion and to eliminate the 
sequence gaps. The competitive carboxypeptidase Y sequencing 
reaction was carried out in the presence of an excess amount 
of lysinamide (1 M, L-Lysinamide dihydrochloride '(Sigma) ) at 
pH 7.5. 

Five /il of peptide substrate (1 mM) and 5 ^1 of 
carboxypeptidase Y (100 (g/ml) were incubated in the buffer 
solution (total volume 100 /il, consisting of 50 mM of Hepes, 5 
mM of EDTA cuid 1 M of lysinamide) at 37**C for 20 minutes. The 
reaction was then stopped by adding 20 /xl of 2% TFA to 10 fil 
of reaction solution. One fil reaction mixture solution was 
mixed with 5 |il MALDI matrix and 0.5 ^tl sample solution was 
analyzed by MALDI-MS. The resulting sequencing spectrum is 
shown in Figure 4 . The spectrum provides the carboxyl 
terminal sequence of up to seven amino acid residues of this 
polypeptide in a single operation. 

The amino acid sequence was determined as follows . 
The first carboxy- terminal amino acid residue was determined 
by adding 127 Da ( L-Lysinamide residue mass) to the mass of 
the intact molecular ion (1760 Da) , to give 1887Da, followed 
by siibtraction of the highest mass of lysinamide modified 
peptide peak (1799.3 Da) . This calculation gives a mass 
difference of 87.7 Da^ which corresponds to a serine residue. 
The remaining carboxy- terminal sequence can be read out from 
the mass differences between adjacent peaks in the spectrum. 
The mass difference of 126.9 Da between the peaks at m/z 
1424.5 and 1297.6 corresponds to the lysinamide. The peaks on 
the lower mass side of the lysinamide residife correspond to 
the unmodified peptides and the peaks on the higher mass side 
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of lysinamlde correspond to the lysinamide modified peptides. 

For contparisoii/ the same peptide was digested with 
carboxypeptidase Y in the absence of lysinamide. Figure 5 
shows the sequencing spectrum after 10 minutes reaction at 
5 370c. The spectrum contains several gaps which lack an 

intermediate member of the peptide ladder. These results 
clearly indicate that the method of this invention balances 
the fast hydrolysis reactions with the aminolysis reactions. 
As a result, the sequence gaps in the normal carboxypeptide Y 

10 digestion (Figure 5) were filled up by the lysinamide 
modifications (Figure 4) . 

L- Lysinamide is used as a nucleophile and 
competitive substrate for peptide substrate hydrolysis by 
carboxypeptidase Y, which decreases the hydrolysis rate. The 

15 aminolysis product generated by formation of lysinamide at the 
C-terminus of peptide can slow down the hydrolysis rate. This 
is shown by the large amount of intact peptide remaining (Fig. 
4) . This reduced rate of hydrolysis is not a disadvantage. 
It provides for more control of the degradation/ladder- forming 

20 process. The rate can be increased by the simple addition of 
more enzyme. 

The present invention provides a substantially novel 
method for preparing biopolymer ladders . While specific 
examples have been provided, the above description is 

25 illustrative and not restrictive. Msuiy variations of the 

invention will become apparent to those of skill in the art 
upon review of this specification. The scope of the invention 
should, therefore, be determined not with reference to the 
above description, but instead should be determined with 

30 reference to the appended claims along with their full scope 
of equivalents. 

All publications and patent documents cited in this 
application are incorporated by reference in their entirety 
for all purposes to the same extent as if each individual 

35 publication or patent document were so individually denoted. 
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(2) INFORMATION FOR SBQ ID NO:l: 

(i) SEQUENCE CH ARACTERISTICS : 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Asp Arg Val Tyr lie His Pro Phe His Leu Leu Val Tyr Ser 
15 10 



wo 9d/36732 



19 



PCTAJS96/06938 



Wm^T IS CIAIMED IS : 



1 1. A method for producing a biopoiymer ladder 

2 comprising contacting an ensemble of biopoiymer molecules with 

3 (i) a terminating reagent that reacts with the 

4 biopoiymer molecules to add terminating moieties and 

5 (ii) a cleavage reagent that removes terminal 
€ units from an end of the biopoiymer molecules; 

7 wherein the rate for removing terminal \inits from biopoiymer 

8 molecules bearing a terminating moiety is less than the rate 

9 for removing terminal units from biopoiymer molecules that do 

10 not bear a terminating moiety, the biopoiymer molecules being 

11 contacted with the cleavage and terminating reagents under 

12 conditions and in amounts so as to create a collection of 

13 biopoiymer molecules having at least two members in a nested 

14 set, 

1 2. The method of claim 1 wherein the cleavage 

2 reagent catalyzes the addition of terminating moieties to the 

3 biopolymers molecules. 

1 3 . The method of claim 2 wherein the rate at which 

2 the cleavage reagent catalyzes removal of non- terminated units 

3 from the biopoiymer molecules is greater than the rate at 

4 which it catalyzes the addition of terminating moieties . 

1 4 . The method of claim 3 wherein the biopoiymer 

2 molecules are peptides. 

1 5 . The method of claim 4 wherein the cleavage 

2 reagent comprises an exopeptidase . 

1 6. The method of claim 5 wherein the exopeptidase 

2 is a carboxypeptidase. 

1 7 . The method of claim 6 wherein the 

2 carboxypeptidase is carboxypeptidase A, carbbxypeptidase 

3 carboxypeptidase Y or carboxypeptidase B. 
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1 8. The method of claim 5 wherein the exopeptidase 

2 is an aminopeptidase . 

1 9. The method of claim 8 wherein the 

2 aminopeptidase is leucine aminopeptidase or microsomal 

3 peptidase. 

1 10. The method of claim 5 wherein the exopeptidase 

2 is a poly-peptidase . 

1 11. The method of claim 6 wherein the terminating 

2 reagent is an amino acid amide. 

1 12 . The method of claim 11 wherein the amino acid 

2 amide is L-lysinamide. 

1 13. The method of claim 1 wherein the terminating 

2 reagent comprises a mixture of isotopes. 

1 14 - The method of claim 3 wherein the biopolymer 

2 molecules are nucleic acids or complex carbohydrates. 

3 15 . The method of claim 1 wherein the cleavage 

4 reagent is thermostable. 

5 16 . The method of claim 15 wherein the cleavage 

6 reagent is a thermostadDle carboxypeptidase. 

7 17. The method of claim 16 wherein the thearmostable 

8 carboxypeptidase is carboxypeptidase Tag. 

1 18. The method of claim 15 wherein the cleavage 

2 reagent is a thermostable aminopeptidase. 

1 19, The method of claim 18 wherein the thermostable 

2 aminopeptidase is an aminopeptidase foimd in Bacillus 

3 stearothexwophilus, Streptococcus thermophilus , Streptococcus 

4 sallvarius subsp. thermophllus , or Sulfolobus solfatstrlcus. 



I 
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1 20. The method of claim 15 wherein the conditions 

2 comprise contacting the ensemble of biopolymer molecules with 

3 the thermostable cleavage reagent at a temperature between 

4 about 70° C and about 100^ C. 

1 21. A biopolymer ladder comprising a collection of 

2 biopolymer molecules having at least two members in a nested 

3 set wherein the members bear the same terminating moiety and 

4 wherein the terminating moiety is added catalytically . 

1 22. The biopolymer ladder of claim 21 wherein the 

2 biopolymer molecules are peptides. 

1 23 . The biopolymer ladder of claim 22 having at 

2 least three members in a nested set. 

1 24 . The biopolymer ladder of claim 23 wherein the 

2 terminating moiety is an amino acid amide. 

1 25. The biopolymer ladder of claim 24 wherein the 

2 terminating moiety is L-lysinamide . 

1 26. The biopolymer ladder of claim 23 wherein the 

2 terminating moiety is an n-acetyl amino acid. 

3 27. The biopolymer ladder of claim 26 wherein the 

4 n-acetyl amino acid is n-acetyl lysine. 

1 28. The biopolymer ladder of claim 21 wherein the 

2 biopolymer molecules are nucleic acids. 

1 29. A method of obtaining information sibout the 

2 structure of a biopolymer comprising the steps of: 

3 (a) preparing a biopolymer ladder comprising a collection 

4 of biopolymer molecules having at least two members in a 

5 nested set according to the method of claim 1, 

6 (b) determining the differences in mass* between the 

7 members , and 



wo 96^6732 



22 



PCTAJS96/06938 



8 (c) determining the identity of units whose masses 

9 correlate with the determined mass differences. 

1 30, The method according to claim 29 wherein said 

2 step of determining differences in mass comprises the steps of 

3 depositing the biopolymer molecules in a matrix; and applying 

4 a laser to desorb the biopolymer molecules from the matrix. 

1 31. A method of obtaining information about the 

2 sequence of amino acids in a polypeptide comprising the steps 

3 of : 

4 a) preparing a polypeptide ladder by steps comprising 

5 contacting an ensemble of polypeptides with . 

6 (i) a terminating reagent that reacts with the 

7 polypeptide molecules to add terminating moieties and 

8 (ii) a cleavage reagent that cleaves a terminal 

9 peptide bond of polypeptides thereby removing a terminal amino 

10 acid; wherein the rate for removing terminal units from 

11 polypeptide molecules bearing a terminating moiety is less 

12 them the rate for removing terminal amino acids from 

13 polypeptides that do not bear a terminating moiety, the 

14 polypeptides being contacted with the cleavage and terminating 

15 reagents under conditions and in amounts so as to create a 

16 collection of polypeptides having at least two members in a 

17 nested set; 

18 b) determining the differences in mass between the 

19 members; and 

20 c) determining the identity of amino acids whose masses 

21 correlate with the determined mass differences, whereby the 

22 order of mass differences in the biopolymer ladder indicates 

23 the identity of the amino acids indicates the sec[uence of 

24 amino acids in the polypeptide* 

1 32. The method of claim 31 wherein the cleavage 

2 reagent catalyzes the addition of terminating moieties to the 

3 polypeptide molecules. 
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1 33 . The method of claim 32 wherein the rate at 

2 which the cleavage reagent catalyzes removal of non- terminated 

3 units from the biopolymer molecules is greater than a rate at 

4 which it catalyzes the addition of terminating moieties to 

5 terminal units of biopolymer molecules. 

1 34 . The method as recited in claim 33 wherein said 

2 cleavage reagent conprises a carboxypeptidase. 

1 35. The method as recited in claim 34 wherein said 

2 terminating reagent comprises an amino acid amide. 

1 36. The method as recited in claim 35 wherein said 

2 cleavage reagent comprises carboxypeptidase-Y and said 

3 terminating reagent comprises L-Lysinamide . 

1 37. The method as recited in claim 33 wherein said 

2 cleavage reagent comprises a mixture of carboxypeptidases 

3 wherein rate of cleavage of peptide bonds is more homogenous 

4 among amino acids than the rate of cleavage using any one of 

5 the carboxypeptidases alone. 

1 38. The method of claim 33 wherein the step of 

2 determining the differences in mass comprises subjecting the 

3 polypeptide ladder to mass spectrometry. 

1 39, The method of claim 38 wherein the method of 

2 mass spectrometry is matrix-assisted laser desorption/ 

3 ionization mass spectrometry. 

1 40. The method of claim 38 wherein the polypeptide 

2 has a mass below about 5 kDa. 

1 41. The method of claim 38 wherein the polypeptide 

2 has a mass below about 3 . 5 kDa . 

1 42 . The method of claim 31 wherein the cleavage 

2 reagent is thermostable. 
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43 . The method o£ claim 42 wherein the cleavage 
reagent is a thermostable carboxypeptidase . 

44 . The method of claim 43 wherein the theirmostable 
carboxypeptidase is carboxypeptidase Tag. 

45. The method of claim 42 wherein the cleavage 
reagent is a thermostcddle aminopeptidase. 

46. The method of claim 45 wherein the thermostable 
aminopeptidase is an aminopeptidase from Bacillus 

s t earo thennophi I us , Strep tococcus thermophi lus, S trep tococcus 
salivaxius eubsp. thermophilus, or Sulfolobus Bolfatarlcus. 

47. The method of claim 42 wherein the conditions 
comprise contacting the ensemble of biopolymer molecules with 
the thermostable cleavage reagent at a temperature between 
about 70° C and about 100 C. 
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